CDS

Accession Number TCMCG024C07833
gbkey CDS
Protein Id XP_021978545.1
Location complement(join(118334541..118334677,118334781..118334830,118335017..118335096,118335185..118335340,118335483..118335723,118335813..118335925,118335999..118336132,118336230..118336443,118336600..118336980))
Gene LOC110873840
GeneID 110873840
Organism Helianthus annuus

Protein

Length 501aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA396063
db_source XM_022122853.2
Definition lysosomal Pro-X carboxypeptidase [Helianthus annuus]

EGGNOG-MAPPER Annotation

COG_category O
Description Lysosomal Pro-X
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K01285        [VIEW IN KEGG]
EC 3.4.16.2        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04614        [VIEW IN KEGG]
ko04974        [VIEW IN KEGG]
map04614        [VIEW IN KEGG]
map04974        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGATATTCATGTTCCAATGGCTTCTTCTCTTCTTGGTCACGTTAACCCCGACAATCATAGGTGCACCCAACAAGACTCCACCACTCAGTCCGGTCAACCCGTCACTAAGAGGCCACACTGATGTTTTCGCGACAAGTTCTAATGACTTCGAAACTTTTTTCTACAACCAAACACTCGACCACTTCAACTTCAAGCCGGAAAGTTATGCCACTTTTCAACAACGGTATATTATCAACTCTAAATGGTGGGGTGGCGCGAGTAAGAATGCACCAATCTTGGTTTATCTTGGTGCCGAAGGACCCATAGACGATGATGTGACCGTCCTTGGGTTCCTCACTGAAAACGCCCCGCGTTTTAAGGCTCTTGTGGTCTTTTTGGAGCATCGTTTTTATGGAGAATCAAATCCGTTCGGGTTAAAAGGAGGATCGCAATCGATAGAGGCAATGGAAGAATCAGTGAAAAACAAGACTATTCGTGGGTATTTCAACTCGGCCCAAGCATTGGCGGATTACGCCGAGTTATTGCTTCATATCAAAAATAAGTTACATGCACATAATTCTCCTATTATTGTTATCGGAGGGTCTTATGGTGGAATGCTTGCATCATGGTTTCGTCTCAAGTATCCACATATCGCGCTTGGTGCTCTTGCTTCATCGGCTCCTATCCTTTACTTCGACGATTTAACCCCTCAAGATGGATATTATTCAATTGTTACCAAAGATTTCGAGGAAGCGAGCGAAAATTGTTACACGACTATAAAGCAATCATGGAACGAGATCGATAGAGTTGCTTCGATGCCAAATGGTCTTGCTATTCTCTCCCAAAAGTTCAACCTTTGCTCTCCTTTAAATAATGCTATGGAGCTCAAGAACTACTTAGACTCCACGTATGCAAGTGCAGCTCAATATAATGCCCCACCAAGGTATCCAACAACCCGAATTTGTCAAGGCATCGATGCCGCAAGCAATGACACTGATATACTTGATCGCGTATTCGCTGGTGTTGTTGCATATCAGCCTAATAGACCTTGCTACAACGTGACACCAGGCGTTACTCAAACTTCCATTGGATGGCAATGGCAAGTTTGTAGTGAGATGGTTATTCCTATAGGCATAACAAGTAATGTGAGCATGTTCCCTAGCTCGCCTTATGACGCAAAAGAATACGACGATGATTGTGACAAAATGTTTGGTGTTATGCCACGGCCTCATTGGGCTACAACATATTATGGCGGTCAGGACATAAGGATGATACTTAGCAAGTTTGGAAGTAACATCATCTTTTCTAATGGTTTAAGAGATCCATATAGCAGTGGAGGAGTGTTAGAAGATATTTCAGAGAACATACTTGCTGTGAAAACAACTAACGGGTCACATTGCTTGGACATATTAAAGTCGGTGGAGACTGATCCCGAATGGTTGGTTAAGCAGCGAAAAGACGAAGTGAAAATCATCAGTAGATGGTTTAGAAAATACTACCAAAATCTTCGTTTATTGAAACAATGA
Protein:  
MIFMFQWLLLFLVTLTPTIIGAPNKTPPLSPVNPSLRGHTDVFATSSNDFETFFYNQTLDHFNFKPESYATFQQRYIINSKWWGGASKNAPILVYLGAEGPIDDDVTVLGFLTENAPRFKALVVFLEHRFYGESNPFGLKGGSQSIEAMEESVKNKTIRGYFNSAQALADYAELLLHIKNKLHAHNSPIIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDDLTPQDGYYSIVTKDFEEASENCYTTIKQSWNEIDRVASMPNGLAILSQKFNLCSPLNNAMELKNYLDSTYASAAQYNAPPRYPTTRICQGIDAASNDTDILDRVFAGVVAYQPNRPCYNVTPGVTQTSIGWQWQVCSEMVIPIGITSNVSMFPSSPYDAKEYDDDCDKMFGVMPRPHWATTYYGGQDIRMILSKFGSNIIFSNGLRDPYSSGGVLEDISENILAVKTTNGSHCLDILKSVETDPEWLVKQRKDEVKIISRWFRKYYQNLRLLKQ